Skip to main content

Reply to this post | See parent post | Go Back
View Post [edit]

Poster: stbalbach Date: Jan 5, 2013 9:43pm
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

The 'date' field for Iroha-biki Monocho is set to "1800", see the metadata file, so it won't return hits for 2012-2013. For searching on upload date use the 'addeddate' or 'publicdate' fields. Since that particular file has no 'createdate' field in the metadata, that search wouldn't return anything is correct.

The issue with Internet Archive is stuff is not consistently entered into the database, the data itself is spotty. So you have to search across multiple fields with "OR" between them to catch all possibilities. It's really very tricky when you get into it. I've written scripts to automatically create generic string strings that examine multiple fields, but the string string is limited to 256 chars in length (or so) thus run into hard limits with that strategy.

Reply to this post
Reply [edit]

Poster: aibek Date: Jan 5, 2013 10:19pm
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

Thanks stbalbach; your reply is helpful.

But, for at least this file, I am not able to search using the ‘addeddate’ or ‘publicdate’ either (even though both the fields are present in the metadata file).

http://archive.org/search.php?query=%28irohi%29%20AND%20addeddate%3A[2012-12-01%20TO%202013-01-05]

http://archive.org/search.php?query=%28irohi%29%20AND%20publicdate%3A[2012-12-01%20TO%202013-01-05]

Reply to this post
Reply [edit]

Poster: philly_bob Date: Jan 6, 2013 8:16pm
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

Thanks for this discussion, folks. Helps me to get a handle on IA's search engine, which has been difficult for a non-Library Science guy like myself.

I've followed the search for the Iroha book, and I'm able to duplicate the search by fixing the Irohi/Iroha typo and entering the search into my browser bar:

http://archive.org/search.php?query=%28iroha%29%20AND%20addeddate%3A[2012-12-01%20TO%202013-01-05]

But I can't duplicate the search from the IA Advanced Search interface. I can call up the "AddedDate" field but the DateRange field only connects to the Date field. Do you have to type the [2012-12-01 TO 2013-01-05] in character-by-character?

Thanx.

Reply to this post
Reply [edit]

Poster: stbalbach Date: Jan 6, 2013 9:35pm
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

You are right the form uses the 'date' field with the date range option - to do a date range with other fields I don't think is possible using that form. I've always typed in searches by hand is more flexible, though it means learning the syntax, it's pretty powerful and works. IMO the advanced search is best for the part on the bottom, for creating CSV or HTML tables, which can be exported to other programs for whatever reason, typically for creating a mass download script.

Notice Internet Archive doesn't return an error "Your search syntax is messed up, try again" .. rather it says "Search engine returned invalid information or was unresponsive. We are working to resolve this issue." .. confusing!

Reply to this post
Reply [edit]

Poster: aibek Date: Jan 7, 2013 4:17am
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

philly_bob,

Also note that date ranges like ‘[2012-12 TO 2013-01]’ and ‘[2012 TO 2013]’ are valid.

It may be useful to collect some sample search strings, and print it. (a “cheatsheet”)

aibek

Reply to this post
Reply [edit]

Poster: stbalbach Date: Jan 6, 2013 1:13am
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

I think it's just a misspelling of 'Iroha'

Reply to this post
Reply [edit]

Poster: aibek Date: Jan 7, 2013 6:07am
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

Can you tell why the following do not work?

http://archive.org/search.php?query=war%20gulf^4
http://archive.org/search.php?query=buttonwood~%20AND%20sort=-reviewdate
http://archive.org/search.php?query=downloads:[10000%20TO%20null]
(all three according to the syntax on the Advanced Search page)

Why does this work:
http://archive.org/search.php?query=%28collection%3Atexts%20OR%20mediatype%3Atexts%29%20AND%20-mediatype%3Acollection&sort=-reviewdate
but not this:
http://archive.org/search.php?query=%28collection%3Atexts%20OR%20mediatype%3Atexts%29%20AND%20-mediatype%3Acollection%20AND%20sort=-reviewdate

And why is a hyphen required in front of the ‘reviewdate’?

Why is the result of:
http://archive.org/search.php?query=-collection%3A%28usfederalcourts%29%20AND%20mediatype%3Atexts&sort=publicdate
different from:
http://archive.org/search.php?query=-collection%3A%28usfederalcourts%29%20AND%20mediatype%3Atexts&sort=-publicdate (this gives the correct result)
Neither of them are ignoring the ‘sort’ term for without sort the result is different:
http://archive.org/search.php?query=-collection%3A%28usfederalcourts%29%20AND%20mediatype%3Atexts


(I learnt that while
http://archive.org/search.php?query=bookANDdownloads:[1000%20TO%202000]
does not work,
http://archive.org/search.php?query=book%20AND%20downloads:[1000%20TO%202000]
works.
Apparently, spaces are required around ‘AND’.)

This post was modified by aibek on 2013-01-07 14:04:19

This post was modified by aibek on 2013-01-07 14:07:34

Reply to this post
Reply [edit]

Poster: stbalbach Date: Jan 7, 2013 7:21am
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

> http://archive.org/search.php?query=war%20gulf^4

The "^4" is odd, not sure what it means. Remove that and it works.

> http://archive.org/search.php?query=buttonwood~%20AND%20sort=-reviewdate

"sort=" can only be used by manually adding it to the end of the URL with a "&". There is no way to sort using the search language, unfortunately.

> http://archive.org/search.php?query=downloads:[10000%20TO%20null]

This one appears to be have two problems: the "null" should "0" (zero), and it should be reverse order ie. "[0 TO 10000]".

> sort=-reviewdate
> why is a hyphen required in front of the ‘reviewdate’

The hyphen says reverse the sort order ie. from newest to oldest. Without the hyphen it would be oldest to newest. The hyphen can also work to exclude things from searches (very useful). So to exclude search from the opensource collection:

-collection:opensource

Reply to this post
Reply [edit]

Poster: aibek Date: Jan 7, 2013 6:44pm
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

Thanks!

So, the only documented features which do not work are the ‘war gulf^4’ and ‘downloads:[10000 TO null]’. (These terms are from the advancedsearch.php page.)

Reply to this post
Reply [edit]

Poster: aibek Date: Jan 6, 2013 3:20am
Forum: texts Subject: Re: Searching book on Chinese Lattice Designs

You are right. Thanks.